Multidimensional evaluation and predicting overall speech quality
نویسندگان
چکیده
The quality of speech samples has been traditionally evaluated in subjective listening tests using 5-point Absolute Category Rating (ACR) scales in Listening Only Tests (LOT) as recommended in ITU-T P.800 [1]. Those tests provide the listening quality aspect of speech quality. There are other tests are under discussion and proposed in order to assess in detail individual perceptual dimensions of speech. In this paper we investigate the relationship between the overall listening quality obtained in an ITU-T P.800 ACR subjective test and the rating of the same signals in four dimensions proposed by Wältermann [2], namely noisiness, discontinuity, coloration and loudness. The database we use is composed of conditions and speech signals extracted from an ACR LOT used in the ITU-T P.863 evaluation, processed by simulated and live telecommunication channels [3]. The signals have been re-scored using the four mentioned scales and are foreseen as contribution to the ITU-T P.AMD project. This paper focuses on the modeling of an ACR LOT score based on individual dimensional ratings under the assumption of orthogonality of the four dimensions.
منابع مشابه
Providing a Multidimensional Measurement Model for Assessing Mobile Telecommunication Service Quality (MS-Qual)
Because of the need to develop specific measurement scales for different services industries, this study aimed to empirically develop a reliable and valid model specifically for measuring mobile telecommunication service quality. A multidimensional measurement model (MS-Qual) has been proposed based on an extensive literature review and then, to assess the model validity, convergent and discrim...
متن کاملEvaluation of objective measures for speech enhancement
In this paper, we evaluate the performance of several objective measures in terms of predicting the quality of noisy speech enhanced by noise suppression algorithms. The objective measures considered a wide range of distortions introduced by four types of real-world noise at two SNRs by four classes of speech enhancement algorithms: spectral subtractive, subspace, statistical-model based and Wi...
متن کاملPerceptual Dimensions of Wideband-transmitted Speech
In this paper it is analyzed which perceptual dimensions are existent for speech that is transmitted over wideband telephone connections. Therefore, two auditory experiments with subsequent multidimensional analyses (multidimensional scaling and semantic differential) were carried out with a diverse set of mixed narrowband and wideband conditions. This revealed a mapping of the perceptual space...
متن کاملPerceptual Quality Dimensions of Text-to-Speech Systems
The aim of this paper is to analyze the perceptual quality dimensions of state-of-the-art text-to-speech systems (TTS). Therefore, several pretests were conducted to determine a suitable set of attribute scales. The resulting 16 scales were used in a semantic differential on a diverse database containing 16 different TTS systems. A subsequent multidimensional analysis (Principal Axis Factor ana...
متن کاملListeners' weighting of acoustic cues to synthetic speech naturalness: A multidimensional scaling analysis
The quality of current commercial speech synthesis systems is now so high that system improvements are being made at subtle suband supra-segmental levels. Human perceptual evaluation of such subtle improvements requires a highly sophisticated level of perceptual attention to specific acoustic characteristics or cues. However, it is not well understood what acoustic cues listeners attend to by d...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
دوره شماره
صفحات -
تاریخ انتشار 2015